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inherited neurodegenerative diseases: 



Several dominantly inherited, late onset, neurodegenerative 
diseases are due to expansion of CAG repeats, leading to 
expansion of glutamine repeats in the affected proteins. These 
proteins are of very different sizes and, with one exception, 
show no sequence homology to known proteins or to each 
other; their functions are unknown. In some, the glutamine 
repeat starts near the N-terminus, in another near the middle 
and in another near the C-terminus, but regardless of these 
differences, no disease has been observed in individuals 
with fewer than 37 repeats, and absence of disease has 
never been found in those with more than 41 repeats. 
Protein constructs with more than 41 repeats are toxic to 
E coli and to CHO cells in culture, and they elicit ataxia 
in transgenic mice. These observations argue in favour of 
a distinct change of structure associated with elongation 
beyond 37-41 glutamine repeats. The review describes 
experiments designed to find out what these structures 
might be and how they could influence the properties of 
the proteins of which they form part. PoIy-L-glutamines form 
pleated sheets of (J-strands held together by hydrogen bonds 
between their amides. Incorporation of glutamine repeats 
into a small protein of known structure made it associate 
irreversibly into oligomers. That association took place during 
the folding of the protein molecules and led to their becoming 
firmly interlocked by either strand- or domain-swapping. 
Thermodynamic considerations suggest that elongation 
of glutamine repeats beyond a certain length may lead to 
a phase change from random coils to hydrogen- bonded 
hairpins. Possible mechanisms of expansion of CAG repeats 
are discussed in the light of looped DNA model structures. 
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,chymotrypsin inhibrtor-2 
dentatorubral-pallidoluysian atrophy 
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glutathione-S-transferase 
Huntington's disease 
isopropylthio-D-galactoside 
Machado-Joseph disease 
spinal and bulbar muscular dystrophy 
spinocerebellar atrophy 
sodium dodecylsulfate 
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DRPU 
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GST 
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IPTG 
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SBMA 
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SDS 



Introduction 

March 1993 saw the discovery of the gene for Huntington's 

disease (HD). a dominnnrlv inherired. imrosomnl. sevrrr 



neurodegenerative disease which generally sets in in 
middle age. The gene consisted of 67 exons spread over 
180 kb of DNA; it contained an open reading frame for a 
single polypeptide chain of over 3100 amino acids. Ex< . 
1 contained a long repeat of CAGs, coding for glutamines, 
followed by long repeats of triplets, mostly CCGs, but 
interrupted by CCAs and CTTs, coding for prolines. The 
only difference between the normal genes and those found 
in patients with Huntington's disease consisted in the 
numbers of CAG repeats: no case of HD has been reported 
among people with fewer than 37 glutamine repeats, nor 
has any case with more than 41 glutamine repeats been 
found free from HD. The severity of the disease increased 
and the age of onset diminished with the number of CAG 
repeats [1 ## ]. Earlier, expanded triplet repeats had been 
recognised as the causes of three other inherited diseases: 
myotonic dystrophy [2^], Fragile X syndrome [5,6], and 
spinal and bulbar muscular atrophy (SBMA), an X-linked 
syndrome also known as Kennedy disease [7]. The first 
two are due, respectively, to expansion of CTG and CGG 
triplets in untranslated regions of the genome, but SBMA 
was found to arise by an expansion of CAG repeats coding 
for a repeat of glutamines in the androgen receptor. 

Since 1993, several other dominantly inherited, late onset, 
neurodegenerative diseases have been discovered to be 
due to expanded glutamine repeats in the affected 
proteins. Each of them specifically affects different 
neurones and gives rise to different symptoms. They are 
Machado-Joseph disease, also known as Spinocerebellar 
Ataxia Type 3 (MJD or SCA3) [8]; Dentatorubral-palli- 
doluysian atrophy (DRPLA) [9,10], and Spinocerebellar 
ataxia type 1 (SCA1) [11]. Ail three diseases show 
autosomal dominant inheritance. An antibody specific 
for expanded glutamine repeats has recently led to the 
detection of two more autosomal and dominantly inherited 
cerebellar ataxias, named types 2 and 7 [12], In a screen 
of 100128 brain cDNAs for polymorphic CAG and CTG 
repeats by hybridisation with different primer sequences, 
Ncri et a/, found three clones of highly polymorphic 
repeats which could play a part in other neurodegenerative 
diseases or multifactorial neuropsychiatric disorders [13]. 
One strange feature common to all these diseases is 
'anticipation* by which is meant earlier onset and increased 
severity in successive generations. This has now been 
explained by the expansion of the CAG repeats on the 
mutant chromosomes, especially in male transmission. 

These remarkable discoveries posed a great challenge to 
biomedical research. What is the molecular mechanism 
of CAG expansion? Can it be prevented? What is the 
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How docs expansion affect them? Why is it toxic to 
specific neurons in the central nervous system? Can the 
toxic effects of expanded glutamine repeats be prevented 
or at least alleviated? 

Since 1993 a large literature on these diseases has grown 
up. In this review I have tried to assemble some of the 
information that has emerged and to give an account of 
my colleagues' and my own approach to the problem from 
the protein-structural point of view; 



Huntington's disease 

Several observations show that HD is due to a gain rather 
than a loss of function. For example, patients with a 
chromosomal translocation which inactivates one of their 
HD genes exhibited no neurological symptoms, nor do 
mice in which one of these genes has been knocked out. 

The function of the HD protein, now called huntingtin, is 
unknown, and so is the function of its glutamine repeat 
That repeat may not even have a function, because it 
is reduced to seven glutamines in mice [14] and to only 
fow in the puffer fish (Fugu rubripes) [15*]. Yet huntingtin 
itself must have a vital function, because *knock-out* 
mice which lack the HD gene die as 8.5 to 10.5 day-old 
embryos [16,17]. This function must be preserved even 
on expansion of the CAG repeat, because patients homo- 
zygous for HD grow up normally and exhibit symptoms 
similar to those of hetcrozygotes. The gene is expressed 
in varying degrees in all tissues. Immunostaining of rat 
brain tissue showed much of the stain to be concentrated 
in presynaptic terminals. After subcellular fractionation 
in sucrose density gradients, huntingtin was enriched in 
a synaptosomal fraction. Its immuno-reactivity dropped 
with washing with 0.2 M or 1 M NaCI, suggestive of loose 
association with membranes. Another study of normal and 
HD-afTected human brains also found huntingtin to be 
membrane-bound, the strongest signal again coming from 
the synaptosomal fraction [18-22]. 

Several studies have shown huntingtin to be present only 
in the cytoplasm, but a study of mouse embryonic and 
human fibroblasts and mouse neuroblastoma cells showed 
that an antibody directed against the C-terminal domain 
of huntingtin stained both nuclei and cytoplasm, but not 
nucleoli. Cytoplasmic and nuclear protein fractions sepa- 
rated by SDS-PAGE and detected on Western blots also 
showed huntingtin in both fractions. The authors attribute 
the failure of others to find huntingtin in the nucleus 
to differences in antibodies and experimental procedures 
123]. The 'question of its location is important, because 
most proteins with glutamine repeats are transcription 
factors, and a nuclear presence opens the possibility that 
huntingtin might act as a transcription factor-associated 
protein, even if it lacks an obvious nucleic acid binding 
site. 
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The incidence of HD varies widely among ethnic groups, 
being lowest among South African Blacks (1 in 105) and 
Japanese (0.11-0.45 in 105), and 4 in 105 on average 
among Europeans, though varying widely in different 
countries and districts within countries [24]. Rubinsztcin 
etal. [25] have shown that the distribution of CAG repeats 
in normal populations is not random, but skewed towards 
lengths above the most frequent length; human CAG 
repeats are longer than those of primates (7-12 repeats). 
There is no evidence that longer repeats in humans 
carry any selective advantage. Rubinsztein et at, therefore 
tried to reproduce the skewed human distribution by a 
computer model based on random mutations and neutral 
genetic drift, with a mutational bias towards longer repeats. 
Their model predicts a mutational bias towards longer 
alleles over all repeat lengths and an increase of the 
mutation rate with length, consistent with the finding that 
new HD mutations originate on chromosomes with high 
normal repeat numbers. These findings led the authors to 
the prediction that, over an evolutionary time scale, the 
lengths of CAG repeats and hence the incidence of HD 
in human populations is likely to increase. 



Machado-Joseph disease (MJD or SCA3) 

Machado-Joseph disease is one of the more common 
inherited spinocerebellar diseases. It is due to expansion 
of the glutamine repeat in a protein of only about 359 
amino acid residues [8]. The repeat lies near the carboxyl 
end and varies from 13 to 36 glutamines in normal 
individuals compared to 61 to 84 in affected patients. The 
remainder of the sequence shows no obvious homology 
to known proteins and contains no nucleic acid binding 
site or transmembrane helices [26]. Introduction into CHO 
cells of the expanded repeats linked to the C-terminal 
residues alone gradually killed the cells, while introduction 
of the full-length MJD protein with either normal or 
expanded repeats, or of the normal-length repeat with the 
C-terminal residues alone had no effect. On introduction 
of the same constructs into transgenic mice they were 
expressed only in Purkinje cells. The expanded repeat 
with the C-terminal residues alone gave rise to ataxia, gait 
disturbances and inability to rear, while the constructs with 
normal repeats had no effect [27]. 

Western blots of the expressed proteins showed signals at 
the expected molecular weights for all except the one with 
the expanded repeat attached to the C-terminal residues. 
This had higher molecular weights which persisted 
even after boiling in 2% Na-dodecylsulfate. The authors 
attribute these to covalent cross-linking of the peptides, 
but Kelvin Stott and I have found oligo-L-glutamines to 
form precipitates insoluble in this detergent without any 
covalent cross-linking (K Stott, M Perutz, unpublished 
data). The hydrogen bonds between the amide groups 
alone seem to be sufficient to make these insoluble. 
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Dentatorubral-pallidoluysian atrophy (DRPLA) 

The normal DRPLA gene encodes a protein of about 1 184 
amino acid residues, with a repeat of 7 to 23 glutamines in 
normal individuals and expansion from 49 to 75 in affected 
patients [10). The repeat begins at residue 484, preceded 
by five histidines. The protein also contains the polar 
zipper sequence REREKEARERDLRDR (using amino 
acid single-letter code), plus several serine repeats and 
segments rich in serines and prolines, but unfortunately 
we lack molecular structures to tell us what these signify. 
RD repeats figure prominently in spliceosomes, which 
makes the authors suggest that the DRPLA protein may 
bind RNA. In the human brain the protein is found in 
the membrane-rich fraction of the neuronal cytoplasm, the 
same that also contains huntingtin. Rat DRPLA protein 
has a repeat of only five consecutive glutamines, followed 
by PQPQPQPQQ [28]. 

Spinal and bulbar muscular atrophy (SBMA) 
or Kennedy's disease 

This X-Iinked motor neurone disease is due to expansion 
of the glutaminc repeat in the androgen receptor, a 
transcription factor of about 918 residues. The repeats 
begin at residue 59, while the DNA binding site is in 
the C-terminal domain. The largest normal allele has 36 
glutamines and the smallest diseased one 40 [29]. Males 
who have point mutations in the androgen receptor gene 
exhibit feminisation, but they do not get motor neurone 
disease. This is convincing proof that Kennedy disease 
is due to gain rather than loss of function. On the other 
hand, males with Kennedy's disease also exhibit mild 
feminisation. 

Kazemi-Esfarjani, Trifior and Pinsky have found the cause 
of this feminisation [30], They constructed an androgen 
receptor-regulated reporter gene and measured its trans- 
activation by an androgen analogue as a function of the 
length of the CAG repeat in a human androgen receptor 
cDNA expression vector. They found transactivation to 
be maximal when all CAGs had been deleted and to 
decrease steadily with the number of CAG repeats. 
They suggest that the glutaminc repeats allow another, 
inhibiting protein to bind to the receptor which modulates 
its activity, but I wonder whether the inhibition is due 
simply to increasing dimerisation or oligomcrisation of the 
receptor itself. 

Spinocerebellar ataxia type 1 (SCA1) 

SCA1 is caused by expansion of a glutaminc repeat in 
a protein of about 810 residues. The repeat starts at 
residue 197. The normal number of glutamines ranges 
from 19 to 36 and the expanded one from 43 to 81 [31], 
In 123 of 126 unexpanded alleles of the SCA1 gene, 
the CAG repeat was interrupted by at least one CAA 
triplet, while all expanded alleles from seven different 
pedigrees had uninterrupted CAG repeats [32J. The 
largest uninterrupted normal repeat contained 21 CAGs. 
While this was stable, 70% of transmissions of longer 



uninterrupted repeats were accompanied by changes in 
length. Similarly in fragile X-syndrome, which is due 
to expansion of non-coding CCG repeats, the repeats 
in normal individuals are interrupted by a single base. 
The amino acid sequence of the SCA protein shows 
no homology with known proteins. As in HD, both the 
normal and the expanded gene are expressed in affected 
individuals. In non-neuronal cells the protein is found only 
in the cytoplasm. In Purkinje cells of the cerebellum it 
is found in both nuclei and cytoplasm, while in other 
neuronal cells it is found only in the nucleus [33]. The 
corresponding protein in mice and rats contains only two 
glutamines. Transgenic mice expressing the human SCA1 
gene with 30 CAG repeats had normal Purkinje cells and 
normal behaviour, while those expressing the gene with 
82 repeats developed ataxia and degeneration of Purkinje 
cells [34], 

The amino acid sequences of the MJD, DRPLA and SCA1 
proteins show no indication of cither nucleic acid binding 
sites or transmembrane helices. 

Table 1 shows that the proteins containing the glutamine 
repeats are of very different sizes; in two of them the 
repeats are near the N-termini, in two nearer the middle 
and in the fifth the repeat lies near the C-tcrminus. 
The most striking common feature is that the number 
of repeats in normal individuals docs not exceed 41. On 
the other hand, several cases of HD with only 37 repeats 
have been found. They have been attributed to 'greater 
penetrance' of the gene, but this is no more than a 
learned name for the enigmatic overlap between normal 
and pathological lengths of the glutamine repeats [35]. 

Molecular mechanism of CAG expansion 

Deletion or addition of CAG repeats could happen during 
replication of DNA by either frameshifts (slippage) or 
recombination. Streisinger and Owen first predicted th:u 
frameshift mutations are likely to occur at repetitive 
sequences, additions by looping-out of the daughter strand 
and deletions by looping out of the parent strand (Fig. 1) 
[36]. Their prediction was confirmed by the nucleotide 
sequences at mutational hot-spots in the lac I gene of 
E. cols; they occurred at tandemly repeated sequences 
such as CTGGCTGGCTGG, which generated additions 
and deletions of the repeated tetranucleotide at a rate 
high compared to other mutational sites (0.5-2 x 10 6 ) [37]. 
Replication of double-stranded CAG/CTG triplet repeats 
in vitro was found to be interrupted by pauses which 
became more marked the longer the repeats [38], The 
location of the pauses was determined by the distance 
between the site of hybridization with the primer and 
the start of the triplet repeat. They were abolished by 
heating at 70*C, which is known to melt hairpins and other 
secondary structure. The pauses may be due to loops or 
hairpins of the kind predicted by Streisinger and Owen. 
They may block the DNA polymerase's progress, forcing 
it to idle and to insert addirional triplets while doing so. 
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Table 1 



Characteristics of glutamine repeats in proteins affected by inherited neurogenerative diseases. 



Disease 



Approximate number of amino acid 
residues in normal protein 



Start of glutamine repeat 
at residue number 



Longest normal 
repeat 



Shortest repeat 
in disease 



HD 

SMBA 

3CA1 

DRPLA 

MJD 



3144 
918 
810 
1184 

359 



18 

59 
197 
484 
297 



41 

36 
39 
39 
36 



37' 
40 
40 
53 
62 



•The CAG repeat of the huntingtin gene is followed by the triplets CAA and CAG. Therefore, the number of glutamines exceeds by two the number 
of uninterrupted CAG repeats. 



Figure 1 



Intermediates in the formation of 
frameshift mutations in double helical 
DNA. (a) Concatameric T4 DNA 
molecule during intracellular growth, 
after endonucleorytic cleavage. »s and 
vertical lines represent a sequence of 
reiterated base pairs, (b) Formation of 
an end. (c) Single-stranded exonuclease 
digestion, (d) Melting of a short terminal 
stretch. (e,f) Annealing in a misaligned 
position, (g) Precursor to an addition 
mutation, (h) Precursor to a deletion 
mutation. In each diagram the bottom 
line represents the template and the top 
line the daughter strand. Note that if the 
extrahelical base occurs in the daughter 
strand (e) an addition will be generated, 
while the bulge defect in the parent 
strand produces a deletion. Reproduced 
with permission from [36], 
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So far as I am aware, no-one has yet demonstrated looping- 
out of single-stranded hairpins from double-stranded 
CAG/CTG in vitro, but two groups have studied the 
conformations of single-stranded triplet repeats [39,40]. 
(CTG)j5_25 an d (CAG)i5_25 do indeed form stable 
hairpins in solution, and they are more stable than hairpins 
formed by other triplet repeats. Interruptions of the 
regular repeats reduce the stability of the hairpins. This 
is interesting, seeing that interruption of the CAG or 
CCG repeats by a single base seems to prevent their fatal 
expansion in humans. 



A little more about the loops can be learnt from Zhu, 
Chou and Reid's [41] detailed NMR structure of the DNA 
decamer CAATGCAAATG. It is centred on the triplet 
GCA, which is related to repeated CAG triplets by a 
simple frameshift. This decamer forms a hairpin joined 
at its apex by the single cytidine and held together by 
interactions between a sheared GA pair (Fig. 2). The 
authors also studied the structure of the three other GNA 
triplets inserted into decamers with the same flanking 
sequences. CGA formed stable antiparallel duplexes while 
GAA and GTA formed equilibrium mixtures of duplexes 




and hairpins; GCA was the only triplet which formed loops 
made of a single nucleotide (C) held together by a GA pair 
The heptadecamcr A(GCA) 5 G migrated in solution purely 
as a hairpin. NMR spectra suggested that the hairpins 
were held together by either G C or anti-anti G A pairs 
Short (CTG) n repeats of the strand complementary to 
(CAG)„ also formed hairpins. The authors suggest that 
looping-out of both DNA strands could lead to Holliday 



junctions which play a part in meiotic gene transposition 
and recombination (Fig. 3). 

In hairpins formed by single-stranded CAG or CTG triplet 
repeats, every third base pair is mismatched, so that they 
would always be less stable than CAG/CTG double helices 
in which all base pairs are linked by complementary 
hydrogen bonds. It seems likely, therefore, that hairpins 




Stereo view of the refined 
CAATGCAATG hairpin structure. The 
stem consists of four basepairs with a 
sheared GA pair at each end. Note that 
6C stacks nicely on 5G of the 5G7A 
pair with its sugar stacked on the 7A 
base. At the open end of the stem, the 
5'-overhang 1C stacks similarly on the 
sheared 10G2A pair which has greater 
propeller-twist (perhaps due to end 
effects). Reproduced with permission 
from [41]. 
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Figure 3 



A model of homologous recombination 
proposed by Robin Holliday (60]. One 
parental duplex is shown in black and 
dark grey, and the other in white and pale 
grey. Either outer strand can base-pair 
with either inner strand, and vice versa. 
X, Y, and 2 denote three genes; x, y, 
and z are alleles. The steps are (a) 
alignment, (b) cleavage of one strand 
of each duplex, (c) invasion, (d) sealing, 
and (e) branch migration. Part (f) is an 
alternative representation of part (e). 
The lower duplex of (e) is rotated 180 
degrees about the verticaJ axis to give 
(0. Note that (0 can be cleaved along 
the (g) horizontal or the (h) vertical axis, 
which corresponds to the cleavage of 
the invading or the non-invading strands, 
respectively. Rejoining of strands gives 
two different sets of recombinants 
(I) and Q). Regions containing one 
strand from each parental duplex, called 
heteroduplexes, are labeled with an 
asterisk. After H Potter and O Dressier 
[61]. Reproduced with permission from 
[62]. 
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arise only when the double helix is unwound before 
replication, or during recombination. 

Toxicity of long polyglutamine repeats in E. 
coli and mice 

Odoneral et al [42] have constructed fusion proteins of 
glutathione-S transferase (GST) with glutamine repeats 
of various lengths whose expression in E. coli could 
be induced by isopropylthio-D-galactoside (IPTG). They 
found that addition of IPTG to their cultures increased 
the expression of GST-fusion proteins with 10 to 35 
glutamines, but decreased expression of GST-fusion 
proteins with 59 to 81 glutamines. Suspecting inhibition 
of cell growth, the authors compared the growth rates of 
£. coli containing constructs without the polyglutamine 
domain with those containing polyglutamine domains of 
various length. Before induction with IPTG all these 
cultures grew at the same rates, but after induction 
constructs with more than 35 glutamines showed reduced 
growth rates (Fig. 4). On the other hand, GST-fusion 
proteins with the equivalent poly-L-alanines had no effect. 

Mangiarini etal [43] have bred transgenic mice expressing 
exon 1 of the human HD gene with different lengths of 
CAG repeats. This exon coded for the 17 mixed amino- 
terminal residues, for the glutamine repeat, and finally 
the two long proline repeats plus the following twelve 
mixed residues. The authors constructed transgenic lines 
carrying expansions of 116-156 repeats. In the three 
lines in which the transgene was expressed, its average 
expression was 75%, 31% and 77% of the endogenous HD 
gene. These mice developed a progressive neurological 
phenotype. In one line, the symptoms developed at 
9-11 weeks, and the mice died at between 10 and 
13 weeks. Their symptoms included z resting tremor, 
involuntary and sudden movements resembling chorea, 
and epileptic seizures induced by handling. The only 
detectable neuropathology was an average reduction in 
brain size of 19%, due to a reduction of all brain structures; 
this resembled early changes associated with HD in 
humans. It might be argued that the long proline rather 
than the glutamine repeat produced the symptoms, but 
this seems unlikely, because that repeat is polymorphic in 
the mouse, and one allele has a repeat almost as long as 
the human one. Control mice transfected with an exon 1 
containing a repeat of only 18 CAGs are five months old 
at the time of writing and show no symptoms. 

Interaction of huntingtin with other proteins 

Li et al. [44] searched for proteins with which the rat and 
human HD proteins associate. They screened a rat brain 
DNA library by ligating a cDNA encoding the first 230 
amino icids of the HD protein to a Gal-4 DNA binding 
domain plasmid vector of the yeast two-hybrid system and 
found one positive clone among 100,000 colonics. It had an 
insert of 600 base pairs encoding part of a new hydrophilic 
protein which associated with the human HD protein. 
Expansion of the glutamine repeat of the HD protein 
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Impairment of growth of £ coli expressing GST*polyglutamine fusion 
proteins. £ coli transformed with clones Q(-) (T); Q10 (■); Q19 
Q35 (O); Q59 (A); Q62 Q; or Q81 {•) were incubated in (a) the 
absence or (b) the presence of 0.1 mMIPTG or measure of bacterial 
concentration in the culture medium. Reproduced with permission 
from 142]. 



enhanced the association. The full protein of more than 
600 amino acid residues contains a repeat of glutamates 
which would associate with glutamine repeats even more 
strongly than glutamine repeats would associate with 
themselves, because hydrogen bonds between amides and 
carboxylates are stronger than hydrogen bonds between 
amides. The protein is enriched also in human brain and 
appears low or absent from other tissues. 

Burke et al. [45] incubated normal human brain homogenate 
with polyglutamine-20 and polyglutamine-60 immobilized 
on resin beads. A protein that was retained only on 
the polyglutamine-60 beads was identified as the ubiq- 
uitous enzyme glyceraldehydc phosphate dehydrogenase 
(GAPDH). In the converse experiment GAPDH immo- 
bilized on agarose beads through its cc-amino group was 
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the DRPLA protein. An antibody was raised against the 
ammo-terminal domain of huntingtin which includes the 
glutamme repeat. This antibody bound several fragments 
of huntingtin which also bound to GAPDH The authors 
suggest that binding of GAPDH by long glutaminc repeats 
m huntingtin or the DRPLA protein may damage neurons 
by interfering with metabolic pathways requiring that 
enzyme, but they admit that such a mechanism fails to 
explain the destruction of only specific cells by proteins 
with polyglutamine repeats. Koshy ttal. [46] found similar 
interactions of GAPDH with the SCA1 protein and the 
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Goldberg e, al. [47] translated truncated cDNAs of the 
HD protein which included the CAG repeat and studied 
its cleavage by a cysteine proteinase active in apoptotic 
cel. death. They found that this enzyme does deavc 
huntingtin both /,/ vitro and in cultured mammalian cell 
mes and that the cleavage rate rises with the length of 
the glutamme repeat. On the other hand, the increase in 
the percentage cleaved after three hours' incubation in 
going from a repeat of 15 glutamines to one of 91 was only 
1.3 fold. It may therefore be premature to suggest, as the 
authors have done, that HD could be due to accelerated 
apoptotic cleavage of huntingtin in the affected neurons. 

Using the yeast two hybrid system, huntingtin has 
been found to interact with human ubiquicin-conjugating 

uSTE T at , taChCS C ■ termina, e'y-ne of g ubiq! 
u t,n to specific lysines of its substrate protein [48 49] 
Ubiqumation is the first step in intracellular breakdown 
of proteins In neurons, it might either precede or follow 
at ack by the apoptotic cysteine proteinase, but again the 
relevance of this reaction to HD is still unclear. 

Bao et al [SO] have found that huntingtin is retained 
on calmodulin Sepharose and that such Sepharosc retains 
more mutant than normal protein from brain extracts 
of HD patients. The physiological relevance of this 
observation is difficult to judge, because the preparation 
or huntingtin included purification on an SDS-PAGE 
gradient which would have denatured the protein. 

Structure and function of glutamine repeats 

ay a strange accident, my attention was drawn to 
glutamme repeats before the publication of the gene for 
Huntington s disease. Such repeats have been found for 
instance ,n some homeodomain proteins of Drosophila. A 
survey ofthe Swiss Prot Data Bank showed that 33 out 
or 40 proteins with 20 or more glutamines in a row are 
transcription factors, many of them in Drosophila, involved 
mostly m the developmental regulation especially of the 
nervous system [51.52]. Wondering what the structure of 
glutaminc repeats might be, I built an atomic model which 
showed that p-strands of poly-L-glutamine could be linked 
together by hydrogen bonds between both their main 



chain and side chain amides (Fig. 5). In other words, thev 
acted as polar zippers, which made me wonder if thev 
attached p-strands of proteins to each other, while leucine 
zippers had evolved to make a-helices stick together [53]. 

Figure 5 
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a synthetic polypeptide and then a protein. Poly-L-glu- 
tamine alone is insoluble in water. To render it soluble, 
my colleague Tony Johnson synthesized the polypeptide 
Asp2-Gln| 5 -Lys 2 . This formed large aggregates which 
circular dichroism, electron-, and X-ray diffraction proved 
to consist of p-plcated sheets, consistent with my polar 
:ippcr idea, even though isolated glutamines in proteins 
tend to favour ct-helix formation [54]. 

It remained to find out whether incorporation of glutamine 
repeats into a small monomeric protein of known structure 
would make it associate into oligomers. On AR Fersht's 
advice, I chose chymotrypsin inhibitor-2 (CI2) from barley 
seeds, a small protein with an external loop of nine 
residues carrying the inhibitory methionine. Kelvin Stott, 
Jonathan Blackburn and I engineered two constructs of 
CI2 from which the first 20 disordered residues had 
been deleted. In one construct, we replaced the loop 
by the sequence Gly-Gln 10 -Gly, and in the other we 
inserted that sequence into the loop (Fig. 6). The two 
constructs, together with the truncated wild-type protein, 
were expressed in good yield in £. coll Fractionation of the 
cell extracts by fast protein liquid chromatography yielded 
the truncated wild type as a single fraction, while each of 
the* constructs yielded two additional fractions of higher 
molecular weight. Analytical ultra-centrifugation showed 
that the wild type fraction contained only monomers, 
whereas the additional fractions contained in the two 
constructs represented dimers and trimers, as predicted if 
the glutamine repeats acted as polar zippers [55]. 

It seemed important to determine the dissociation con- 
stants of the CI2 oligomers, but they proved stable up to 
70*C, above which they denatured. Nor did they dissociate 
in urea at concentrations below 3M; at higher urea 
concentrations they denatured cooperatively with their 
glutamine repeats. Suspecting that this was a consequence 
of the large number of hydrogen bonds between the row 
of ten glutamines, Teca Galvao made constructs of the 
inhibitor with only 8, 6 or 4 glutamines inserted into its 
loop. Contrary to expectation, reduction of the number of 
glutamines only reduced the yields of dimers and trimers, 
but not their stability, which suggested that the glutamines 
made these protein molecules interlock in some manner 
while they were folding in the £. colt cell. To test this idea, 
Stott denatured a concentrated solution of the monomers 
in 4M guanidinium-HCI and then allowed them to refold 
by dialyzing the solution against water. The result was a 
mixture of monomers, dimers and trimers, as found in the 
E. coii extract, which confirmed that these oligomers do 
indeed form during folding of the protein. 

Scott now began to wonder whether the formation of 
oligomers was specific for glutamines, or whether it 
was a consequence simply of our having enlarged the 
external loop and would occur also if other amino acid 
residues were inserted into it. He therefore engineered 
the trypsin inhibitor with an insert of ten glycines and 



Figure 6 



(a) Structure of wild-type CI2; the first 18 residues are disordered 
and are not shown [7]. Loop residues 54-62 are represented by 
spheres. The G!y-Gln 10 -Gfy peptide was inserted into the loop 
at Met59 in one mutant, while the second mutant was generated 
by replacing residues 54-61 of the loop with this peptide, (b) 
Amino acid sequence of the truncated wild-type CI2 and of the two 
mutants. Original residues of the loop are underlined. Reproduced 
with permission from [55]. 



serines instead of glutamines. This construct did form 
dimers and trimers, but in the reversible denaturation 
test the ratios of dimers/monomers and trimcrs/monomers 
formed were respectively one third and one ninth of those 
with glutamine repeats. In another test with an insert 
of alanines and serines engineered by Pctra Scamborova 
these ratios were only slightly lower than those with glu- 
tamine repeats, showing that van der Waal's interactions 
between the methyl group of the alanines and/or hydrogen 
bonds between the y-hydroxyls of serine could also 
stabilize interactions between the CI2s. In each construct 
association was again irreversible short of denaturing the 
proteins, showing that the oligomers formed by some 
manner of interlocking between monomers during folding. 
Formation of irreversibly linked oligomers may be due 
either to domain-swapping, as found in several proteins 
[56], or more likely to strand exchange, as found in 
ct r antitrypsin where an external loop of one molecule 
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inserts itself into the P-sheet of a neighbouring molecule 
[57]. The answer awaits structural analysis. 

As far as the diseases are concerned, the most obvious 
inference would be that the glutamine repeats cause 
self-association into large protein aggregates, but immuno- 
staining of the degenerate neurons in post-mortems of HD 
patients has shown no evidence for these. Alternatively, 
association with other proteins may poison the cells. 

I now come to the most remarkable property of proteins 
with glutamine repeats, namely the toxicity of repeats with 
more than 41 consecutive glutamines, not only to certain 
neuronal cells, but even to E. colt and some mammalian 
cells in culture. This abrupt transition to toxicity suggests 
that elongation beyond 41 is associated with a phase 
change, i.e. a transition between two distinct structures. 
Isolation of a monoclonal antibody which recognises 
specifically expanded glutamine repeats in huntingtin and 
in the proteins of SCA1 and 3 also speaks in favour 
of a phase change, i.e. a structural transition caused by 
expansion [58* # ]. Thermodynamic considerations suggest 
what this transition might be. 

A preliminary NMR study of the structure of CI2 with an 
insert of ten glutamines showed them to have a random 
structure. In a random coil, the high entropy of the 
glutamine repeat itself would have a stabilizing effect; on 
the other hand, its free amides would bind and immobilize 
water; this would diminish the entropy of the system 
and therefore its stability. On the other hand, in a rigid 
hairpin with hydrogen bonds between all the amides, such 
as shown in Figure 5, the entropy of the polyglutamine 
repeat itself would be low, but the liberation of water 
molecules would compensate for this by increasing the 
entropy of the system of protein plus water. These two 
effects may balance each other up to a critical length 
when another effect would enter. This is known as the 
chelate effect and is a major stabilizing factor of protein 
and nucleic acid structures. 

With each pair of glutamines joined together to form a 
hairpin, the loss of translational and rotational entropy 
suffered on joining the next pair of glutamines would 
be lessened. At a critical length, that loss may become 
negligibly small and the system would then become 
strongly stabilised by the gain in entropy due to the 
liberated water molecules. Hairpins may therefore become 
dominant when the number of glutamines exceeds 41. 
These ideas will now have to be tested with suitable 
protein models. Hairpins may insert themselves into other 
protein molecules of cither the same or a different kind 
and cause interactions which poison the cell, perhaps by 
the formation of oligomers during protein folding. 

It seems that the formation of oligomers by insertion of 
the P-strand of one molecule into the p-sheet of another 



molecule during their partial unfolding and refolding 
via a Molten globule 1 state may cause other diseases. 
The polymerisation of prions responsible for scrapie, and 
by implication for bovine spongiform encephalitis and 
Jakob-Creutzfeld disease, is accompanied by a transition 
from a mainly oc-helical to a P-sheet structure, induced 
by infectious prions with a p-sheet structure. A fatal, 
human, familial amyloidosis was found to be caused by the 
aggregation of soluble, partly oc-hclical mutant lysozyme 
molecules into insoluble fibrils with a P-sheet structure 
([59] and . M Sunde, P Pepys, CM Dobson, personal 
communication). Our observations may therefore lead us 
to a better understanding of the molecular pathology also 
of amyloid diseases. 
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Abstract 

Anticipation describes an inheritance pat- 
tern within a pedigree with an increase in 
disease severity or decrease in age at onset 
or both in successive generations. The 
phenomenon of anticipation has recently 
been shown to be correlated with the ex- 
pansion of trinucleotide repeat sequences 
in different disorders. We have studied 
differences of age at onset and disease 
severity between two generations in 14 
families with unilinear inheritance of bi- 
polar affective disorder (BPAD). 

There was a significant difference in age 
at onset (p<0*008), in episodes per year 
with (p<0-006) and without (p<0*03) lith- 
ium treatment, and in total episodes per 
year (p<0*002) between generations I and 
II. Furthermore, there was a highly sig- 
nificant correlation (p<0-001) in age at 
onset between generations I and II. No 
evidence for specific paternal or maternal 
inheritance was found. 

We found evidence of anticipation and 
could rule out ascertainment bias or some 
other artefact. Anticipation is thus an in- 
heritance pattern in BPAD which suggests 
that the expansion of trinucleotide repeat 
sequences is a possible mode of in- 
heritance in BPAD. 

(J Med Genet 1994;31:686-689) 



Bipolar affective disorder (BPAD) is a severe 
neuropsychiatric disorder with a prevalence of 
about 1% in the population. Previous fam- 
ily, 12 twin, 3 and adoption studies 4 support the 
hypothesis that genetic transmission is a major 
aetiological factor in the disease. Linkage ana- 
lysis has suggested gene localisation to chro- 
mosome 1 1 for BPAD, 5 but further studies have 
not been able to confirm these results and the 
results have been inconsistent. 6 " 8 The mode of 
inheritance of BPAD is still unknown but the 
recent demonstration of anticipation in BPAD 
opens up new possibilities for studying the 
genetic transmission of this disorder. 9 

Anticipation describes an inheritance pattern 
within a pedigree with an increase in disease 
severity or decrease in age at onset or both 
in successive generations. 1011 In early clinical 
studies anticipation was suggested as an in- 
heritance model of psychiatric disorders. 12 " 14 
Anticipation has been found in myotonic dys- 
trophy, 10 15 fragile X syndrome, 16 Huntington's 
disease, 17 " 20 amyloidosis, 21 and BPAD. 9 Anti- 
cipation has been shown to be directly associ- 



ated with the expansion of trinucleotide repeats 
at the disease locus in fragile X syndrome," 2 
myotonic dystrophy, 23 24 and Huntington's 
disease. 1 '" 20 

In this study we have examined 13 two gen- 
eration unilinear BPAD family pairs and one 
BPAD family affected in three successive gen- 
erations, to look for evidence of anticipation. 
Our results are in agreement with anticipation 
as an inheritance pattern in pedigrees of BPAD, 
resulting in an increase in disease severity and 
a decrease in age at onset in successive gen- 
erations. 



Material and methods 
BPAD patients were recruited from the lithium 
dispensaries at the Psychiatric Clinics of Umea 
and Harnosand Hospitals in northern Sweden. 
Lithium therapy is used as a prophylaxis in 
BPAD patients, aiming to reduce the frequency 
and severity of depressive/manic episodes. Its 
feasibility depends on compliance from the 
patient. From the register we found that about 
25% of these patients fulfilled the DSM-IIIR 
criteria of BPAD (types I and II). The patient 
files were checked by OPCRIT, 25 which further" 
confirmed the diagnosis. 

Information about the occurrence of psy- 
chiatric morbidity in the relatives of these pa- 
tients was obtained from medical records. In 
the present study we included only pedigrees 
with unilinear inheritance of BPAD. Cases of 
BPAD with other diagnoses in their pedigrees 
such as schizophrenia, schizoaffective syn- 
drome, unipolar major depression, or unipolar 
mania were excluded from the study. Al- 
together we found 14 pedigrees with unilinear 
inheritance of BPAD, one of these showing 
three generation inheritance. In five of these 
pedigrees, one parent and two children each 
had BPAD. 

Anticipation was measured as (1) age at onset 
and (2) frequency of episodes with and without 
lithium treatment. The age at onset was defined 
by the first episode of either depression or 
mania. Episode frequency was used as a meas- 
ure of disease severity and its overall value was 
calculated by the total number of episodes 
of depression and mania divided by the total 
number of years from age at onset to death or 
to the time of investigation. For each person, 
information was also obtained regarding the 
episode frequency and the length of time with 
or without lithium treatment, respectively. This 
enabled separate estimates of episode fre- 
quency either during lithium treatment or when 
there was no lithium treatment. 
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ternal or maternal inheritance and for the 
correlation analyses, however, pairwise com- 
parisons were made within parent-offspring 
pairs. Each of the five families with two BPAD 
children were treated as two parent-offspring 
pairs. Statistical significance was tested by the 
-Wilcoxon rank sum test for pair observations. 
Correlation analyses were performed by Spear- 
man correlation coefficients. Two tailed p val- 
ues were used for all tests. 

Results 

The family data including sex, generation, year 
of birth, age at onset, total episodes per year, 
and episodes per year with and without lithium 
treatment are shown in table 1 . 

In generation I (n= 15) the range for birth 
year was 1 892- 1 940 and for age at onset 20-52 
years. The corresponding figures for generation 
II (n = 20) were 1913-1974 and 14-44 years, 
respectively. The mean age at onset was 35-1 
(SD 10-3) in generation I and 25 0 (SD 8*4) in 
generation II. There was a significant difference 
(p<0 008) in age at onset (101 years) between 
generations I and II. 

The number of episodes per year in gen- 
erations I and II, with and without lithium 
treatment, can be seen in table 2. A significantly 
higher frequency (p<0002) of total episodes 
per year was found in generation II (0-78, 
SDO-58) compared with generation I (0-33, 
SD018). There were also significant differ- 
ences in episodes per year with (p<0 006) and 
without (p<0 03) lithium treatment between 
generations I and II, with the highest fre- 
quencies in generation II. 

The distribution of the age at onset (AO) for 
the 20 parent-offspring pairs can be seen in fig 
1. Anticipation (age at onset in parent minus 



Table 2 Episodes per year in generations I and II with and without hthtum treatment 
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Figure I Hie differences in age at onset (AO) between the 20 family pairs with BB4D. 
P= parent, 0~ offspring. 

STATISTICAL ANALYSIS 

The three generation family was broken up 
into two generation pairs, so the person in the 
middle generation appeared once as a parent 
and once as a child. For comparing generation 
I and generation II, the Wilcoxon rank sum 
test for independent samples was used to test 
for statistical significance. To investigate pa- 
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offspring) of 20 cases with BPAD. A nonnal distribution curve is plotted. 
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Table 3 Paternal and maternal inheritance of bipolar affective disorder: 



Generation 
pair 



No 



Age at onset 



age at onset 
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Offspring 
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Father-son 
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Range (y) 


Mean (SD) 


Diff 
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age at onset in offspring) was distributed as 
shown in fig 2. The plot of anticipation against 
parental age at onset is shown in fig 3, with 
the regression line showing a positive intercept. 

There was a highly significant correlation 
(r = 0*71, p<0001) for age at onset between 
generations I and II, which can be seen in fig 
4. There was also a significant correlation for 
age at onset between father and offspring (r = 
0-94, p<0*001). No such significant correlation 
was found between mother and offspring and 
between mother and daughter, respectively. 
When studying the inheritance from father to 
son there was a highly significant correlation 
(r = 0*97, p<0 001, n = 7) for age at onset be- 
tween generations I and II. 

Maternal and paternal inheritance can be 
seen in tables 3 and 4. There were significant 
differences in total episodes per year and age 
at onset between father and offspring (p<0*05, 
p<0008), mother and offspring (p<0005, 
p<0005), and mother and daughter (p<003, 
p<002). There was also a significant differ- 
ence in age at onset between father and son 
(p<002). 

To search for a possible cohort effect con- 
tributing to the difference in age at onset within 
parent-offspring pairs, we calculated the cor- 
relation between the year of birth difference 
and the onset age difference within the pairs. 
We found the correlation to be non-significant 
(r = 0*01, p = 0-97, n = 20), indicating an ab- 
sence of cohort effect. 



Discussion 

Our results support the occurrence of an- 
ticipation in families with BPAD. We found 
evidence for both a decrease in age at onset 
and an increase in severity of the disease in 
successive generations. 

In this study we have exclusively selected 
families with unilinear inheritance of BPAD in 
order to avoid the possibility of transmission 
of genetic material responsible for the disease 
from both parents. At present it is not possible 
to exclude the possibility that the BPAD could 
be a heterogeneous group containing more than 
one disorder and more than one mode of in- 
heritance. We have also excluded pedigrees 
containing diagnoses such as schizophrenia, 
schizoaffective disorder, unipolar depression, 
and mania for the same reasons. This gives us 
a fair opportunity to study anticipation as a 
mode of inheritance in BPAD. 

In a recent study of BPAD, the disease had 
8-9-13*5 years earlier age at onset and was 
1*8-3-4 times more severe in generation II 
compared with generation I. 9 In our study the 
disease had about 10 years earlier age at onset 
and was about twice as severe in the second 
generation, which is in good agreement with 
previous results. 

Generation II showed a significandy higher 
(p<0*03) frequency of episodes per year during 
the period without lithium treatment. A sig- 
nificantly higher frequency (p<0006) of epis- 
odes per year was also found during treatment 
with lithium. Therefore, we suggest that lithium 
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which is also an indication that the disease is 
more severe in the next generation. 

Interestingly, we found a highly significant 
correlation (p<0-001) between age at onset in 
generation I compared to II, which further 
supports a genetic mode of transmission in 
BPAD. We also found evidence of only positive 
anticipation (fig 3). These results suggest the 
occurrence of anticipation rather than as- 
certainment bias or some other artefact. 26 

The phenomenon of imprinting, with pre- 
ferential expression of the maternally or 
paternally inherited allele, is known to affect 
gene expression and development. In this study 
we found that paternal or maternal trans- 
mission of genetic material influenced the age 
at onset and severity of the disease in the second 
generation equally. However, the correlation 
coefficients for age at onset between father and 
offspring was higher than that between mother 
and offspring. 

Sporadic cases are often found in many psy- 
chiatric diseases such as schizophrenia, schizo- 
affective disorder, bipolar affective disorder, 
unipolar depression, and mania. These cases 
are usually called non-familial with no obvious 
hereditary background in the family. If the same 
phenomenon with the decreasing of tri- 
nucleotide repeats between generations (as re- 
cently shown for myotonic dystrophy 27 ) is 
occurring in BPAD, followed by an expansion, 
this could be an explanation for the occurrence 
of some of the sporadic cases. 

Many pedigrees show a heterogeneous pat- 
tern with different diagnoses within the same 
pedigree. Lately, the question has again been 
raised whether schizophrenia and affective dis- 
order are genetically associated. 2 " An in- 
heritance model with a differential expansion 
of trinucleotide repeats in the same locus for 
different persons may turn out to be an im- 
portant candidate to explain the observed het- 
erogeneity. 
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Summary 

Anticipation, or increasing severity of a disorder across successive generations, is a genetic phenomenon with an 
identified molecular mechanism: expansion of unstable trinucleotide repeat sequences. This study examined 
anticipation in familial schizophrenia. Three generations of siblines from the affected side of families selected 
for unilineal, autosomal dominant-like inheritance of schizophrenia were studied (n = 186), Across generations 
more subjects were hospitalized with psychotic illness (/><.0001), at progressively earlier ages (P<.0001), and 
with increasing severity of illness (/><.0003). The results indicate that anticipation is present in familial 
schizophrenia. These findings support both an active search for unstable trinucleotide repeat sequences in 
schizophrenia and reconsideration of the genetic model used for linkage studies in this disorder. 



Introduction 

The clinical observation of anticipation — i.e., inherited 
illness that becomes more severe across successive gen- 
erations—has recently been found to have a molecular 
basis: expanding GC-rich trinucleotide repeat sequence 
mutations (Harper et al. 1992; Sutherland and Richards 

1992) . In fragile X syndrome (Verkerk et al. 1991), 
myotonic dystrophy (Fu et al. 1992), spinobulbar mus- 
cular atrophy (Brook et al. 1992), spinocerebellar atro- 
phy type 1 (Orr et al. 1993), and Huntington disease 
(Huntington's Disease Collaborative Research Group 

1993) , increasing severity of illness, earlier age at onset, 
and/or increasing proportion of ill individuals in suc- 
cessive generations are associated with longer trinucleo- 
tide expansions. Schizophrenia is another neuropsychi- 
atric disorder that may display this anticipation 
phenomenon and that therefore may have familial 
forms caused by an unstable trinucleotide repeat. 

Schizophrenia is a severe disorder characterized by 
social withdrawal and psychotic symptoms, such as de- 
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Iusions and hallucinations. The illness has a variable age 
at onset, often beginning in early adulthood and result- 
ing in lifelong disabilities in social and occupational 
functioning. Evidence from family, twin, and adoption 
studies, including those using reliable diagnostic crite- 
ria (Lowing et ah 1983; Kendler et al. 1985), strongly 
supports a genetic etiology for schizophrenia (Gottes- , 
man and Shields 1982). However, the mode of inheri- 
tance for schizophrenia is not readily identifiable and is 
proposed to involve interacting genes (Risch 1990). In 
families with the illness, reduced penetrance and vari- 
able expression are commonly found. Other psychotic 
disorders, of lesser severity, and schizotypal personality 
traits such as social isolation, odd communication, and 
extreme suspiciousness are conditions likely reflecting 
variable expression of genetic susceptibility to schizo- 
phrenia (Gottesman and Shields 1982; Lowing et al. 
1983; Kendler et al. 1985). These factors, along with the 
possibility of genetic heterogeneity and the practical 
difficulties of studying an illness with a significant sui- 
cide rate and suspicious, socially isolated individuals, 
combine to make schizophrenia a challenging disorder 
for linkage studies (Bassett 1991). Strategies to over- 
come these difficulties include focusing on familial 
schizophrenia where inheritance is consistent with 
Mendelian patterns, using reliable diagnostic methods, 
highly polymorphic DNA markers, and lod-score 
methods that model the complexities of the inheri- 
tance. Linkage studies to date, using informative fami- 
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lies and models based on Mendelian inheritance, have 
yielded significant lod scores but no replicated positive 
results (Bassett 1991). 

Dynamic modifications of classical patterns of ge- 
netic transmission, such as anticipation (Mott 1911) 
and genomic imprinting (differential expression of ge- 
netic material depending on parental origin of the gene 
and the underlying molecular mechanisms), may ex- 
plain the complex genetics of schizophrenia and other 
major mental illnesses (Mclnnis et al. 1991; Hint 1992). 
For example, trinucleotide repeats can cause reduced 
penetrance and variable expression Y^askey et al. 1992; 
Sutherland and Richards 1992), by existing in a premu- 
tation form, by reductions in repeat size, or by somatic 
mutation in early embryogenesis (Lavedan et al. 1993). 
The unstable nature of trinucleotide repeats also pro- 
vides a possible mechanism for the high mutation rates 
proposed for schizophreniaysiater and Cowie 1971). If 
there were evidence of anticipation in schizophrenia, 
screening for triplet repeat mutations would become a 
rational option for gene localization studies, and modi- 
fication of the genetic model used in linkage studies 
would need to be considered. The current study inves- 
tigated whether anticipation was present in a familial 
schizophrenia sample participating in a linkage study. 

Subjects and Methods 

Subjects were members of eight extended noncon- 
sanguineous families participating in a genetic linkage 
study of familial schizophrenia. Local psychiatrists 
identified prospective pedigrees segregating schizophre- 
nia. Families were selected for large size, availability of 
two or more generations of adults, and apparent unilin- 
eal, autosomal dominant-like inheritance of schizo- 
phrenia and genetically related disorders. Bilineal fami- 
lies with evidence, from family or collateral history, of 
schizophrenia or other nonaffective psychotic dis- 
orders on both sides were excluded. Further details of 
the original ascertainment and assessment for the link- 
age study are described elsewhere (Bassett et al. 1993). 
Since families were ascertained in their entirety, pro- 
band status could be assigned to all affected subjects. 
Therefore no subjects were excluded from analyses. 

Subjects only from the affected side of each family 
were taken into account, to determine sibling sets (sib- 
lines) in the index generation (IG) (n = 13 siblines), 
parental generation (PG) (n = 10 siblines), and grand- 
parental generation (GG) (w = 8 siblines). The affected 
side was defined by (1) a parent hospitalized with psy- 
chosis (four cases), (2) an aunt/uncle hospitalized with 



psychosis (four cases), (3) a parent/aunt/uncle with 
schizotypal traits (seven cases), or (4) a sibship linking 
two affected nuclear families (six cases in two extended 
families) (see fig. 1). In two cases in the GG, the affected 
side could not be determined, and the smaller of the 
unaffected maternal or paternal siblines was arbitrarily 
selected. There were three instances of unknown pater- 
nity. In two of these cases, the maternal line was af- 
fected. In the third case, neither the mother nor her five 
siblings were affected, and this GG sibline was not in- 
cluded in the analysis. 

Family-history information was obtained for each 
subject from three or more family members by using 
the Family History-Research Diagnostic Criteria (FH- 
RDC) method (Andreasen et al. 1977). Genealogical re- 
cords were used to confirm dates of birth and death. 
Medical records were searched back to 1866 for evi- 
dence of psychiatric hospitalization. Because the sub- 
ject families originated and seldom moved from a cir- 
cumscribed region of Canada, and because the one 
psychiatric hospital available until the 1980s consis- 
tently maintained a comprehensive file-card system of 
recording admissions, virtually complete ascertainment 
of psychiatric hospitalization was assured. Records 
were collected for all subjects with a history of psychiat- 
ric hospitalization. Living subjects participating in the 
linkage study were directly interviewed by a psychiatrist 
(Bassett eral. 1993). Diagnostic folders containing the 
family history and, if present, medical records and in- 
terview data were reviewed independently by two psy- 
chiatrists (A.S.B. and W.G.H.), one of whom (W.G.H.) 
was blind to the pedigree structures. A consensus life- 
time Research Diagnostic Criteria (RDC) diagnosis for 
psychotic disorders, age at first hospitalization for a 
psychotic illness, and presence of two or more RDC 
schizotypal traits were recorded. Psychotic disorders 
included schizophrenia {n = 25), schizoaffective dis- 
order (n = 13; 12 mainly schizophrenic type and 1 
other), unspecified functional psychosis (n = 4), mania 
with psychosis (n - 1), and depression with psychosis [n 
= 1). Schizophrenia and schizoaffective disorder were 
approximately equivalent in severity in the current sam- 
ple (Bassett et al. 1993) and were considered together in 
the current study. Individuals with psychotic disorders 
not severe enough to require hospitalization, as well as 
subjects with schizotypal traits, were combined in a 
single schizotypal group, because of small numbers in 
each category. 

Of the 209 subjects in the affected siblines, 23 were 
excluded from the analyses. One IG subject had not 
attained the age of 15 years, considered a minimum age 
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Figure I Three pedigrees of theeightfam- 
ihes studied, illustrating anticipation in familial 
schizophrenia. IG, PG, and GG siblines on the 
affected side are shown. The numbers below indi- 
viduals indicate their age at first hospitalization 
for psychotic illness. An unblackcned square (Q) 
denotes an unaffected male; an unblackened cir- 
cle (O) denotes an unaffected female; a black- 
ened square (■) or circle (•) denotes hospitaliza- 
tion for a psychotic disorder; and a 
half-blackened square (TJ) or circle (3) denotes 
schizotypal conditions. Sex and birth order of 
some individuals have been changed to protect 
confidentiality. A slash (/) through the symbol 
denotes that the individual is deceased. The box 
outlines a single affected lineage from a family 
connected at the grandparental level. 
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of nslc for psychotic illness (Gottcsman and Shields 
1982). Five subjects (2 PG and 3 GG) had moved, and 
collateral information was insufficient to determine 
hospitalization status; and 17 subjects (3 IG, J PG, and 
9 GG) died before the age of 40 years. Thirteen died in 
infancy or childhood, two in war, one in an accident at 
work, and one of unknown cause; none were suicides 
Most new cases of schizophrenia may be expected be- 
fore age 40 years (Gottesman and Shields 1982). Data 
on the remaining 186 subjects were examined for antici- 
pation, in three ways. First, the rates of hospitalization 
for psychotic disorders and the rates of schizotypal 
conditions were compared across generations by using 
X analyses. Second, to assess severity of illness, subjects 
were assigned the following ratings: hospitalized with 
schizophrenia or schizoaffective disorders— 3; hospital- 
ized with other psychotic disorders— 2; schizotypal— 
1; and unaffected-0. Means for each generation were 
compared using the one-way analysis of variance (AN- 
OVA), including correction for multiple tests of signifi- 
cance with the Student-Newman-Keuls procedure. 
Third, age at first hospitalization for psychosis was as- 
sessed using the life-table method of survival analysis 
for 1-year intervals. Homogeneity of survival curves 
over the generations was examined using the Wilcoxon 



test. The analysis was performed assuming (1) no differ- 
ential mortality between affected and unaffected and 
(2) hospitalization rates independent of chronological ' 
time (e.g., 1920 vs. 1970). Observations ended at the 
subject's current age or age at death. Covariates tested 
were sex and transmission patterns (matemal/patemal). 

Results 

Demographic characteristics of the sample and re- 
sults indicating anticipation are presented in table 1. As 
for other illnesses demonstrating anticipation, expres- 
sion of ,llness varied between members of a sibship 
(fig. 1). v 

Rates and Distribution of Illness 

There were significantly more subjects hospitalized 
tor psychosis across successive generations ft 2 = 16.84, 
P<.0001, 2 df). Most had schizophrenia or schizoaffec- 
tive disorders (IG, n = 30; PG, n = 8; and GG, n = 0). Of 
the six subjects with less severe disorders-unspecified 
functional psychosis (» = 4), psychotic mania (» = 1), or 
psychotic depression (n = l)-four were in PG or GG. 
Subjects with the least severe illnesses (schizotypal con- 
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Table I 

Characteristics of the Sample, by Generation (n = 186) 



IC PC GG 



Total no. of subjects (females) 8 $ / 38) 61 (24) „ ,- n 

""U 

Mean age of hv,n£ SD 4Wl (g74) 65J5 

Mean age at death' (SD, ^ ' ^ ^ 

Mean age at first hospitalization (SD) 26.16 (128) 34.00 (17.28) 41 JJ (15.89) 

Subjects hospitalized for psychotic illness .... 32 (3721%) 9 (14 .52%) 3 (7.89%) 

Subjects with schizotypal conditions g (9.30%) 8 (12.90% ) 8(21.05%) 

* No. of living subjects: IG, 82; PG f 42; and GG, 10. 

' d ?t SUbiCCt i ^ 38C 31 dcalh: IG * 4; PG ' 2 * and GG » 20 - Ei S"< other GG subjects had 
ages at death that were less precisely known, e.g., «in their 70s." The mean shown did not change signifi- 
cantly when these subjects were included using estimated ages at death 
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ditions) were twice as common in the GG as in the IG 
(21.05% vs. 9.30%); however, across the three genera- 
tions, the result was not significant (x 2 = 3.24, P = .20, 
2 df). When these subjects with schizotypal conditions 
were included with hospitalized subjects, there were 
still significantly more affected subjects in the youngest 
generation ft 2 = 6.86, P = .032, 2 df). 

Because of the possibility of compounded error with 
families being connected at the parental or grandparen- 
tal level, analyses were rerun with a single affected lin- 
eage from each of the seven kindreds that had data 
from all three generations, with the largest siblines be- 
ing selected (see fig. 1). The same results were found 
using this subsample of 151 subjects, for both hospital- 
ization rates & 2 = 12.30, P<.002, 2 df) and hospitaliza- 
tion plus schizotypal rates (x 2 = 9.46, P<.009, 2 df). 
Excluding subjects who died before age 40 or moved 
away (n = 22) could have influenced the study's find- 
ings, since most were in the two senior generations. 
When results with these subjects and with penetrance 
estimated maximally at 100% were considered, so that 
one-half (n = 11) would have been hospitalized with 
psychotic disorders, the results for hospitalization rates 
would still have remained significant (x 2 = 10.02, 
P<.01, 2 df). Data were also reanalyzed examining the 
effect that increasing hospitalization rates over time 
would have on the results. Secular trends of hospitaliza- 
tion rates for specific illnesses were not available. There- 
fore, arbitrary increases were tested, by adding 100% 
more subjects (n = 3) to the hospitalized GG group and 
50% more (n - 4.5) to the PG group. Under these con- 
ditions, a significant increase in the rate of psychosis 
requiring hospitalization would continue to be present 
oyer the generations (% 2 = 7.75, P<.05, 2 df). 



Severity of Illness 

Means for four-point severiry-of-illness ratings for 
the three generations were as follows: IG (n = 86), 1.19; 
PG (n « 62), 0.58; and GG (n = 38), 0.37. Severity of 
illness significantly increased over the generations (F 
= 8.39, 2 df, P = .0003). Pairwise comparisons using 
the Student-Newman-Keuls test revealed that the signif- 
icant difference was between the IG and the PG and 
GG. PG and GG severity means were not significantly 
different from each other. 

Age at First Hospitalization 

The survival curve (fig. 2) shows that subjects were 
first hospitalized for psychosis at progressively younger 
ages, across generations (z 2 = 26.76, P = .0001, 2 df). 
Sex of subject was not a significant covariate to genera- 
tion, for age at first hospitalization (increment x 2 
= 0.40, P » .53, 1 df); All of the GG and most (68%) of 
the PG subjects had achieved an age of 55 years or 
more, by which time they would have been virtually 
through the age at risk for schizophrenia (Gottesman 
and Shields 1982). 

Imprinting 

There were equal rates of maternal and paternal 
transmission (six and seven cases, respectively) from the 
PG to the IG. The mean age at first hospitalization for 
maternal transmission (PG to IG) was 25.00 years (SD 
5.94 years), and that for paternal transmission was 
27.64 years (SD 10.63 years), a nonsignificant difference 
(* « .83, P = .41). Grandparental to parental transmis- 
sion was predominantly maternal (seven of nine cases), 
with one unknown and only one example of paternal 
transmission. Sex of transmitting parent was not a sig- 
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Age (years) 



Figure 2 Survival curves for age ar first hospitalization for 
psychotic illness, comparing IG (A), PC (•), and GG (■). Annual 
percent sumving without hospitalization is plotted for every Jth 
year. Observation of individuals ended at their age of death or, L,ey 
were living, at their current age. ^ 



nificant covariare in the survival analysis examining age 
at first hospitalization (increment Y 2 = 0 32 P = <7 
2df). ' 

Oiscussion 

The results suggest that familial schizophrenia ex- 
hibits anticipation. All of the families studied showed 
this phenomenon, manifest as increasing rates of hospi- 
talized psychotic illness, worsening severity of illness 
and/or earlier age at onset, across successive genera- 
tions (fig. 1). These findings are consistent with differ- 
ences in rates of hospitalization and age-at-onset data 
for parent-child pairs in studies of schizophrenia over 
the century (Mott 1910; Kay 1963; Penrose 1971- De- 
cinaetal. 1991). In each of these studies, rates of hospi- 
talization for psychosis were lower for antecedent gen- 
erations, and age at onset for parents was significantly 
later than that for offspring. Investigations of ancestors 
and extended families also support these findings 
(Karlsson 1966; Odegaard 1972; Wetterberg and 
Farmer 1991). As well, less severe psychotic illnesses 
(eg, affective disorders) are consistently more com- 
mon in the generation antecedent to schizophrenic 
probands (Slater and Cowie 1971; Bleuler 1978) These 
results complement reported morbid risk of schizophre- 
nia for parents, which is almost half that for siblings 
(Gottesman and Shields 1982). While alternative rea- 
sons, including selection biases such as reduced fertility 
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•n earlier-onset schizophrenia, have been proposed to 
explain these clinical observations, they are all consis- 
tent with the phenomenon of anticipation in schizo- 
phrenia. 

In contrast, findings from the current study are only 
suggestive of sex-specific differences in the transmis- 
sion of schizophrenia. However, the possibility of an 
excess maternal over paternal transmission in schizo- 
phrenia is consistent with trends found both recently 
by others (Sharma et al. 1993) and in studies of large 
data sets m the older literature (Penrose 1971- Slater 
and Cowie 1971). Fertility may be especially low in 

S. M P fIo e " tS With schiz °Phrenia (Gottesman and 
Shields 1982), and in the current study this could be the 

, r the high rate of maternal transmission from 
the GG lines to the PG lines. If this were the case 
however one would have expected predominantly ma- 
ternal inheritance from the PG to the IG lines; but ma- 
ternal and paternal rates were equal. Comparable trans- 
mission patterns have been found to be due to greater 
variation of trinucleotide repeat length after female 
meiosK in myotonic dystrophy (Lavedan et al. 1993) 
Jhe effect may be subtle, requiring larger samples to 
demonstrate imprinting in schizophrenia. 

There are several possible biases that can explain re- 
sults that indicate anticipation (Penrose 1948). First 
subjects could have died before expressing the muta- 
tion. However, all of the GG and most of the PG lived 
beyond age 55 years. Also, significantly different hospi- • 
talization rates across generations remained in the 
current study, even when half of those who had moved 
or died before age 40 years were assigned affected sta- 
tus. Second, reduced fertility of individuals with earlier 
onset of schizophrenia could cause preferential ascer- 
tainment of parents with later onset. This bias should 
be minimized in the current study, because (1) few par- 
ents were affected with psychosis and (2) siblines were 
large in all three generations, providing multiple oppor- 
tunities for detection of affecteds, regardless of their 
fertility. Third, subjects in the IG could have been too 
young to yet express a late-onset form of psychosis, 
which would attenuate the age-at-onset findings. Since 
one-half of the subjects in the IG were over age 40 
years, most were beyond the period of highest risk, 
tven if more new cases of psychosis subsequently 
arose, this would most likely occur in the IG and would 
only serve to strengthen the present study's findings 
with respect to differential rates of illness. 

In contrast to major depression (Gershon eral. 1987) 
there is no evidence for a cohort effect in schizophre- 
nia. However, secular trends, such as improved detec- 
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tion, that could, over time, lead to higher hospitaliza- 
tion rates and/or younger age at first hospitalization 
for psychosis are important to consider, since these 
could have influenced the principal findings of the 
current study. In the literature (Mott 1910; Kay 1963; 
Penrose 1971; Decina et al. 1991), examination of ages 
at first hospitalization did not reveal secular trends to 
younger age over the century. In the current study, ar- 
bitrary increases in hospitalization incidence assigned 
to the PG or GG did.not change the observation of 
anticipation. Specific factors that may influence secular 
trends in hospitalization, including drug abuse and psy- 
chosocial stressors, do not appear to have played a role 
in the sample studied. Only three hospitalized subjects 
in the IG had a history of stimulant or hallucinogen use, 
drugs that may in some cases precipitate a psychotic 
illness. On the basis of direct interviews, it appeared 
that psychosocial stressors endured by PG and GG, 
such as the World Wars and Great Depression, were 
more severe than those faced by the IG. 

Another possibility is that the family-history method 
tends to underestimate rates of psychiatric disturbance, 
particularly in relatives who are dead or less known 
(Andreasen er al. 1977). The consequence could be that 
actual rates for schizotypal traits could be higher than 
those found, particularly for PG and GG. However, 
results for severe illness requiring hospitalization would' 
remain unchanged. Another factor that could have 
compounded errors in the PG and GG was the use of IG 
siblines connected at the parental or grandparental 
level. However, both the fact that results remained the 
same when only one ascending line from each kindred 
was examined and the fact that at least 13 affected 
siblines had resulted from eight originating GG lines 
support the finding of anticipation. 

A limitation that is important to consider in studies 
of common illnesses is the potential for assortative 
mating to cause an apparently increased prevalence of 
illness in offspring. This possibility was minimized by 
selecting unilineal pedigrees with no evidence of schizo- 
phrenia or related disorders in the married-in person, 
their siblings, or parents. Although individuals marrying* 
in could have been nonexpressing carriers of the dis- 
order with nonexpressing close relatives, the likelihood 
appears small that assortative mating could account for 
the results in the current study. Because the families 
studied were selected because of their autosomal domi- 
nant-like inheritance and large sibships, the results may 
not be generalizable to schizophrenia in the general pop- 
ulation, although they are consistent with observations 
from large population-based samples (Penrose 1971; 
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Bleuler 1978). Also, the schizophrenia in the subject 
families may be a particularly severe form, which could 
exaggerate the findings. However, the mean age at on- 
set for the IG is similar to others' results (Mott 1911; 
Gottesman and Shields 1982; Decina et al. 1991; 
Sharma et al. 1993). Data on specific symptom patterns 
suggest that the familial schizophrenia in the present 
families is comparable in nature and in severity to sam- 
ples drawn from the general population (Bassett et al. 
1993). 

Despite the possible biases and limitations, the 
weight of evidence from both the current investigation 
and the literature is consistent with the finding of antici- 
pation in familial schizophrenia. Other families with 
schizophrenia should be examined for anticipation and 
possible accompanying maternal imprinting phenom- 
ena, to confirm the current study's findings. Although 
published pedigrees consistently show evidence of an- 
ticipation (Karisson 1966; Wetterbeig and Farmer 
1991), complete ascertainment of other large kindreds 
with contemporary reliable diagnostic assessments 
would be useful. However, the most exciting possibili- 
ties—and the confirmation of the clinical observations 
of the current study— lie in the search for expanding 
trinucleotide repeats and other DNA sequence muta- 
tions in schizophrenia. New methods becoming avail- 
able to detect these mutations (Orr et al. 1993; Schal- 
ling et al. 1993) will complement and may accelerate the 
search for pathological genes in linkage studies. In ad- 
dition, the current study has implications for the ge- 
netic model used in linkage studies. Parameters, particu- 
larly penetrance, which would vary according to 
generation, may need to be modified to reflect the ef- 
fects of anticipation and possibly imprinting. In the 
light of clinical evidence for anticipation, these strate- 
gies represent real promise for deciphering the genetics 
of schizophrenia. 
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